LLM Optimization

LLM optimization techniques

distillation

quantization

pruning